PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PITBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) Internationa! Patent Classification 6 : 

C12N 15/70, 15/62, 9/24, 1/21, A61K 
38/47, C12Q 1/34 // (C12N 1/21, C12R 
1;19) 



Al 



(11) International Publication Number: 
(43) International Publication Date: 



WO 97/02351 

23 January 1997(23.01.97) 



(21) Internationa] Application Number: PCT/GB96/0I577 

(22) International Filing Date: 1 July 1996 (01.07.96) 



(30) Priority Data: 
9513683.4 



5 July 1995 (05.07.95) 



GB 



(71) Applicants (for all designated States except US): CIB A-GEIGY 

AG [CHKTH]; Klybeckstrasse 141, CH-4002 Basle (CH). 
CA MBRID GE UNIVERSITY TECHNICAL SERVICES 
LIMITED [GB/GB]; The Old Schools, Trinity Lane, Cam- 
bridge CB2 ITS (GB). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): TAYLOR, Peter, William 
[GB/GB]; Marringdean Oak, Marringdean Road, Biliing- 
shurst, West Sussex RH14 9HF (GB). LTJZIO, John, Paul 
[GB/GB]; Pippin Meadow, Lowfields, Little Eversden, 
Cambridge CB3 7HJ (GB). BRYANT. Jonathan, Marie 
[GB/GB]; 23 Monmouth Street, Topsham, Exeter EX3 0AJ 
(GB). 

(74) Agent: SHARMAN, Thomas; Ciba-Geigy PLC, Patent Dept. 
Hulley Road, Macclesfield SK10 2NX (GB). 



(81) Designated States: AL, AU, BB, BG, BR. CA, CN, CZ, EE 
GE, HU. EL, IS. JP. KP, KR, LK, LR, LT, LV, MG, MK, 
MN. MX. NO, NZ. PL, RO, SG, SI, SK, TR, TT, UA, 
US, UZ. VN. ARIPO patent (KE. LS, MW, SD, SZ, UG), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD. RU, TJ, TM), 
European patent (AT, BE, CH, DE, DK, ES. FI. FR. GB, 
GR, IE, IT. LU. MC, NL, FT, SE). OAPI patent (BF. BJ, 
CF, CG. CI, CM, GA, GN, ML, MR, NE, SN, TD. TG). 

Published 

With international search report. 



(54) Tide: RECOMBINANT PROTEIN HAVING BACTERIOPHAGE ENDOS1AUDASE ENZYMATIC ACTIVITY 
(57) Abstract 

A recombinant protein having bacteriophage endosialidase enzymatic activity obtainable by expression from a recombinant vector 
comprising a DNA sequence encoding a bacteriophage endosialidase linked to a DNA sequence of an expression vector which expresses a 
polypeptide which adds to the N-terminus of the endosialidase, or an analogue of said protein, which is a mutant, functional fragment or 
denvative of said protein having endosialidase enzymatic activity. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AM 


AxmenU 


AT 


Austrii 


AU 


Australia 


BB 


Barbados 


BE 


Belgium 


BF 


Burkina Faso 


BG 


Bulgaria 


BJ 




BR 


Brazil 


BY 


Belarus 


CA 


Canada 


CF 


Central African Republic 


CC 


Congo 


CH 


Switzerland 


ci 


Cflced'Ivojre 


CM 


Cameroon 


CN 


China 


cs 


Czechoslovakia 


cz 


Czech Republic 


DE 


Germany 


DK 


Denmark 


EE 


Estonia 


ES 


Spain 


FI 


Finland 


FR 


France 


GA 


Gabon 



GB 


United Kingdom 


GB 


Georgia 


GN 




GR 


Greece 


HU 


Hungary 


IE 


Ireland 


IT 


Italy 


JP 


Japan 


KE 


Kenya 


KG 


Kyrgystan 


KP 


Democratic People's Republic 




of Korea 


KR 


Republic of Korea 


KZ 


Kazakhstan 


U 


Laxhtenste in 


LK 


Sri Lanka 


LR 


Lmeria 


LT 


Lithuania 


LU 


Lmembourg 


LV 


Latvia 


MC 




MO 


Republic of Moldova 


MG 


Madagascar 


ML 


Mali 


MN 


Mongolia 


MR 


Mauritania 



MW 


Malawi 


MX 


Mexico 


NE 


Niger 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


PT 


Portugal 


RO 


Romania 


RU 


Russian Federation 


SD 


Sudan 


SE 


Sweden 


SG 


Singapore 


SI 


Slovenia 


SK 


Slovakia 


SN 


Senegal 


sz 


Swaziland 


TD 


Chad 


TC 


Togo 


TJ 


Tajikistan 


TT 


Trinidad and Tobago 


UA 


Ukraine 


UC 


Uganda 


US 


United States of America 


uz 


Uzbekistan 


VN 


Vict Nam 



WO 97A>2351 



PCT/GB96/0I577 



RECOMBINANT PROTEIN HAVING BACTERIOPHAGE ENDOSIALIDASE ENZYMATIP, 
ACTIVITY 

This invention relates to a recombinant protein having bacteriophage endosialidase activity, 
to a process for the production thereof and to recombinant expression systems for use in 
the production thereof. 

Bacteriophage E is a member of the PK1A-PK1E family of phages; these phages were 
isolated originally from European sewage to aid in the clinical identification of Escherichia 
coli K1 infections, which can result in high mortality rates in cases of neonatal meningitis. 
Bacteriophage E endosialidase (K I E endosialidase) is thought to be the enzyme 
responsible for initial binding to host bacteria by specifically recognising and hydrolysing the 
a-2,8-linked poly-N-acetylneuraminic acid (polysialic acid/PSA) carbohydrate polymers of 
the K1 glycocalyx. a-2,8-iinked PSA is also expressed on the cell surface of several other 
pathogenic bacteria, and various tumour cells and cell lines. It has been proposed in U.S. 
Patent No. 4 695 541 that K1 E endosialidase could be used in the diagnosis and therapy of 
K1 meningitis, septicaemia or bacteraemia due to the enzyme's high specificity for 
hydrolysing ct-2,8-siaiosyl linkages. PSA has been suggested as an oncodevelopmental 
marker in human tumours of the kidney and neuroendocrine tissues and also may 
contribute to the invasive and metastatic potential of some tumours. 

In J. Bacterid., I993, 175. 4354 - 4363, there are described attempts to obtain enzymatically 
active protein by expression from a DNA construct derived from the related KIF phage; 
these attempts were unsuccessful. 

It has now been found that protein having bacteriophage endosialidase enzymatic activity, 
i.e. a protein which specifically binds to or cleaves a-2,8-polysiaIic acid, can be obtained by 
expression from a DNA construct which is derivable from the KIE endosialidase gene and is 
cloned into an expression vector which expresses a polypeptide which adds to the N- 
terminus of the endosialidase sequence. 
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Accordingly, the present invention provides, in one aspect, a recombinant protein having 
bacteriophage endosialidase enzymatic activity obtainable by expression from a 
recombinant vector comprising a DNA sequence encoding a bacteriophage endosialidase 
linked to a DNA sequence of an expression vector which expresses a polypeptide which 
adds to the N- terminus of the endosialidase, or an analogue of said protein which is a 
mutant, functional fragment or derivative of said protein having endosialidase enzymatic 
activity. 

The mutant may be, for example, a protein having an amino acid substituted or deleted at 
one or more positions. The functional fragment may be C- or N- terminal shortened 
fragment or a fragment from within the polypeptide chain which has endosialidase 
enzymatic activity. The derivative may be, for example, a pharmaceutical^ acceptable salt 
with an acid such as hydrochloric acid, sulphuric acid, phosphoric acid, pyrophosphoric 
acid, benzenesuiphonic acid, p-toluenesulphonic acid, methanesulphonic acid, lactic acid, 
palmic acid, tartaric acid, ascorbic acid, or citric acid; with a base, usually a nitrogen 
containing base such as sodium, potassium, magnesium or ammonium nitrogen-containing 
base; or an internal salt. 

In another aspect, the present invention provides a recombinant vector comprising a DNA 
sequence encoding a bacteriophage endosialidase linked to a DNA sequence of an 
expression vector which expresses a ploypeptide which adds to the N- terminus of the 
endosialidase, said recombinant vector being capable of directing expression of said 
protein in a compatible host cell. 

In a further aspect the present invention provides a process for the production of a protein 
having bacteriophage E endosialidase enzymatic activity which comprises culturing a host 
cell transformed with a recombinant vector as hereinbefore defined under conditions 
allowing expression of said protein and isolating the protein thereby produced. In a yet 
further aspect, the present invention provides a host cell transformed with a recombinant 
vector as hereinbefore defined. 
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Preferred protein according to the invention is a protein obtainable by expression from a 
recombinant vector as hereinbefore defined in wich the ONA sequence encoding the 
endosialidase is derived from a DNA construct encoding amino acid residues encoded by 
nucleotides 172 to 1744 of the bacteriophage E endosialidase gene, i.e. nucleotides 172 to 
1744 of SEO ID No. I as hereinafter defined, or a mutant, functional fragment or derivative of 
said protein which has endosialidase enzymatic activity. An especially preferred protein 
according to the invention is a protein obtainable by expression from a recombinant vector 
as hereinbefore defined in which the DNA sequence encoding the endosialidase is derived 
from a DNA construct encoding amino acid residues encoded by nucleotides I to 2436 of 
the bacteriophage E endosialidase gene, i.e. nucleotides I to 2436 of SEQ. ID NO. I as 
hereinafter defined, or a mutant, functional fragment or derivative of said protein having 
endosialidase enzymatic activity. 

The protein of the invention is generally expressed in the form of a fusion protein 
comprising the endosialidase linked, directly or through a spacer, to a polypeptide derived 
from the expression vector, i.e. the vector used for expression of the protein in a suitable 
host cell, a preferred such polypeptide being glutathione S-transferase. Where it is desired 
that the polypeptide components of the fusion protein should be separable, if the fusion 
protein does not naturally contain a region which can be specifically cleaved chemically or 
enzymatically, such a region can be inserted using conventional procedures. Examples of 
selective cleaving reagents or cleaving enzymes for fusion proteins are V8 protease, 
trypsin, thrombin, factor X. CNBr, peptidase ysca and yscF. 

In a particularly preferred embodiment of the invention, the protein of the invention is in the 
form of a fusion protein comprising bacteriophage E endosialidase linked to glutathione S- 
transferase, the fusion protein preferably having a molecular weight of about 100 kDa. 

A DNA construct, i.e. recombinant DNA molecule, suitable for the expression of a protein 
according to the invention may be an isolated DNA fragment encoding a bacteriophage 
endosialidase, for example consisting only of the coding region or prolonged by 
homologous or heterologous DNA sequences. The construct may be a DNA fragment 
encoding the endosialidase cloned into a suitable cloning vector, preferably a bacterial 
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vector such as pBR3I7, pBR322 f pUCI8, pSF2!24 or, especially, Bluescript SK\ Where 
such a clone lacks convenient restriction sites with which to isolate solely the endosialidase 
open reading frame, it may be amplified by a polymerase chain reaction (PGR) using 
primers incorporating the restriction sites required. 

The DNA fragment encoding the bacteriophage endosialidase may be obtained from 
genomic bacteriophage E DNA or a synthetic DNA that is substantially homologous thereto, 
i.e. is 80-100% homologous thereto. Bacteriophage E can be purified and total genomic 
DNA can be extracted using conventional procedures. The extracted DNA can then be 
digested with an appropriate restriction enzyme such as Bgl II, Eco Rl. Hinc II, Hind III, Bam 
HI or Pst I. The digestion product can be subjected to preparative electrophoresis with low- 
melting point agarose gel to enrich DNA fractions of a certain length in order to enrich DNA 
fragments encoding the protein of the invention. 

When a nucleotide sequence encoding the bacteriophage endosialidase, or an amino acid 
sequence thereof, is known, DNA encoding the endosialidase can also be prepared by 
methods leading directly to the desired DNA such as conventional PCR procedures or in 
vitro chemical synthesis. 

For expression of a protein of the invention, the DNA construct is cloned into an expression 
vector which expresses polypeptide which adds to the N- terminus of the endosialidase to 
give a recombinant vector according to the invention. The expression vector is, of course, 
chosen according to the nature of the host cell chosen for expression of the protein. 
Suitable such expression vectors are available commercially. Expression is preferably 
carried out in a prokaryotic host, more preferably a microbial host, especially E. coii, when a 
suitable expression vector is a prokaryotic expression vector such as a phage X or a 
bacterial piasmid. Examples of particular prokaryotic expression vectors are pGEX vectors, 
e.g. pGEX-2T (Pharmacia Biotech), which result in the expression of an endosialidase - 
glutathione S-transferase (GST) fusion protein, pMAL (New England Biolabs) which results 
in expression of an endosialidase - maltose binding protein fusion protein, the 'pinpoint* 
system from Promega which biotinylates expressed protein, the 'strep-tag' system from 
Biometra which places a streptavidin binding peptide on expressed protein, the Ni-NTA 
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system from Qiagen which adds 6 histidines to expressed protein to bind nickel, and the 
Xpress system from Invitrogen working on a similar principle to the Ni-NTA system. 

Preferred expression vectors are pGEX vectors, which have a tac promoter, an internal lac 
l Q gene and a thrombin or factor X« protease recognition site, especially pGEX-2T which has 
the sequence: 

Leu Val Pro Arg Gly Ser Pro Gly lie His Arg Asp 

CTG GTT CCG CGT GGA TCC CCG GGA ATT CAT CGT GAC TGA CTG ACG 

Cloning of the DNA construct into the expression vector to give the recombinant vector of 
the invention may be carried out using conventional restriction and ligation techniques. 
Thus, where the DNA construct contains Bam HI and EcoRI restriction sites, which may 
have been incorporated by PCR amplification, the DNA construct and the expression vector 
may be digested simultaneously with Bam HI and EcoRI and ligation effected using a DNA 
ligase in accordance with the manufacturer's instructions. 

As mentioned hereinbefore, the host cells used for expression of a protein of the invention 
are preferably prokaryotic, more preferably microbial cells, including cells of bacteria such 
as Bacillus subtilis. Pseudomonas. Streptococcus or, especially. E. coli. 

Transformation of the host cells may be carried out using conventional techniques 
appropriate for those cells. Accordingly, the transformation procedure for E. coli cells 
includes, for example, Ca 2 * pretreatment of the cells so as to allow DNA uptake, and 
incubation with the recombinant vector. The subsequent selection of the transformed cells 
can be achieved, for example, by transferring the cells to a selective growth medium which 
allows separation of the transformed cells from the parent cells, or by restriction analysis of 
a miniprep DNA sample obtained from the incubated cells. 

The transformed host ceils may be cultured by methods known in the art in a liquid medium 
containing an assimilable source of carbon, e.g. a carbohydrate such as glucose or lactose, 
nitrogen, e.g. an amino acid, peptide, protein or degradation product thereof such as a 
peptone, ammonium salt or the like, and an inorganic salt, e.g. a sulfate, phosphate and/or 
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carbonate of sodium, potassium, magnesium or calcium. The medium may also contain, for 
example, a growth-promoting substance, such as a trace element, for example iron, zinc, 
manganese and the like. 

Culturing may be effected by processes which are known in the art. The culture conditions, 
such as temperature, pH value of the medium and fermentation time, are chosen so that a 
maximum expression level of the protein of the invention is obtained. Thus, an E. coli strain 
is preferably cultured under aerobic conditions by submerged culture with shaking or stirring 
at a temperature of about 20°C to 40°C, preferably at about 37°C, and a pH value of 4 to 8, 
preferably of about 7, for about 4 to 30 hours, preferably until maximum yields of the protein 
of the invention are reached. 

The expressed protein can be extracted from microbial cells such as E. coli ceils or a 
supernatant of a cell culture by conventional methods, e.g. comprising lysis of the cells, 
chromatography such as ion-exchange, hydrophobic or size-exclusion chromatography, 
precipitation, e.g. with ammonium sulfate or acid, preparative electrophoresis such as 
sodium dodecyl sulphate - polyacrylamide gel electrophoresis (SDS-PAGE) or isoelectric 
focussing, and the like. When, as in especially preferred embodiments of the invention, the 
expressed protein is an endosialidase-glutathione S-transferase fusion protein, this may be 
purified by binding to glutathione beads as described by Smith and Johnson (1988) Gene 
67, 31-40. Cleavage of the purified fusion protein can be effected with thrombin, for 
example following the instructions of Pharmacia Biotech, manufacturers of the pGEX-2T 
expression vector. 

The present invention also provides a pharmaceutical composition comprising as active 
ingredient a protein of the invention or a pharmaceutical^ acceptable salt thereof, optionally 
together with a physiologically acceptable carrier, which may be, for example, an excipient, 
diluent or other conventional auxilliary in pharmaceutical compositions. 

Proteins of the invention may be used in the diagnosis or treatment of medical conditions, 
especially of the human body, including various diseases, particularly meningitis and 
cancers characterised by expression of polysialic acid on the surface of the tumour cell, 
such as Wilm's Kidney Tumour, small cell lung carcinoma, neuroblastoma, medullary thyroid 
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carcinoma, urinary tract tumour, neuroectodermal tumour, teratoma, rhabdomyosarcoma, 
pheochromocytoma, Ewing's sarcoma, insulinoma, breast cancer and pituitary tumour. 
The proteins may be used to inhibit tumour metastasis, for example post-surgical 
metastasis. The proteins may also be used in the diagnosis or treatment of other conditions 
caused by E. coli Kl, such as sepsis and urinary tract infections, or by other bacteria 
expressing polysiaiic acid on the cell surface thereof. 

Thus the present invention also provides a method of treating a condition caused by a 
bacterium expressing polysiaiic acid on a cell surface thereof, cancer characterised by 
expression of polysiaiic acid on a tumour cell surface, or tumour metastasis, which 
comprises administering a protein or analogue of the invention as hereinbefore defined to a 
warm-blooded mammal in need of such treatment. 

A pharmaceutical composition of the invention, particularly for the above indications, may 
be administered parenterally, for example intravenously, intracutaneous^, subcutaneously 
or intramuscularly. The dosage depends principally on the method of administration and on 
the purpose of the treatment. Individual doses and the administration regime can best be 
determined by individual judgement of a particular case of illness. Usually, a therapeutically 
effective amount of a protein of the invention, when administered by injection, is from about 
0.005 to about 0.1 mg/ kg body weight. 

In addition to the active ingredient, an injectable pharmaceutical composition of the 
invention may contain a buffer, for example a phosphate buffer, sodium chloride, mannitol 
or sorbitol to adjust the isotonicity. and an antibacterially active preservative such as the 
methyl or ethyl ester of p-hydroxybenzoic acid. 

The proteins of the invention, in view of their enzymatic activity, may also be used in the 
analysis of glycoproteins, for example detection and sequencing of oligosaccharide 
moieties decorating glycoproteins, since they can selectively remove particular sugar 
residues from the glycoproteins. 

The invention is illustrated by the following Examples, which relate to especially preferred 
embodiments. 
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Example 1 Preparation of DNA Construct Containing Bacteriophage E Er\dQs\aMa< $t > 
Open Reading Frame 

Unless otherwise stated, all procedures used are as described by Sambrook et aJ, 
Molecular Cloning: a Laboratory Mannual, 2nd Edition, Cold Spring Harbor Laboratory 
Press, New York (I989). 

Degenerate oligonucleotide probes are designed with reference to E. coli codon usage 
tables (Holm (I986) Nuc. Acids Res. 14. 3075-3087), prepared using an automated Applied 
Biosystems PCR - MATE model 39I DNA synthesiser and 5' end-labelled with [T-^PJATP 
(Amersham International Pic, Amersham, Bucks, U.K.) using T4 polynucleotide kinase. 
The radiolabeled oligonucleotide probes are hybridised to restriction enzyme digests of 
bacteriophage E DNA, electrophoresed in agarose gels and transferred to Hybond-N nylon 
membrane (Amersham International Pic). Bacteriophage E DNA fragments reacting with 
the probes are identified by autoradiography, purified from NA grade agarose geis 
(Pharmacia Biosystems Ltd, Milton Keynes, Bucks, U.K.) and ligated into Bluescript SK+ 
(Strategene Inc., La Jolla. CA, USA) using T4 DNA ligase (NEB Inc.). Transformations of E. 
coli Epicurian SURE cells (Strategene Inc.) with Bluescript SK+ are conducted according to 
an electroporation method (Dower et at (I988) Nucleic Acids Res. 16, 6I27 - 6I45) using a 
Bio-Rad Gene Pulser and Pulse Controller, or alternatively high efficiency E. coli JMI09 
competent cells (Promega Inc. Madison. Wl, USA) are transformed by heat shock at 42°C 
for 60 sec. Clones transformed with recombinant plasmid are identified by growing on 2TY 
/ampicillin agar plates and using a mixture of 50 mg/ml 5-bromo-4-chloro-3-indoyl-{J-D- 
galactopyranoside (X-Gal) and 0.1 M isopropyl {J-D- thiogalactopyranoside (IPTG) to allow 
blue-white colour selection of colonies. Double stranded DNA sequencing is conducted 
using the Sequenase Version 2.0 sequencing kit from United States Biochemical 
Corporation, Cleveland, OH, USA and a model SA sequencing apparatus from BRL Life 
Technologies Inc., Gaithersburg, MD, U.S.A. Sequencing is facilitated by the technique of 
nested deletions or by using synthetic oligonucleotide primers prepared by British Bio- 
technology Products Ltd, Abingdon. Oxon, U.K. or as above. 

A degenerate oligonucleotide probe, Probe 1 [5*-TAC(T)CAC(T)CAGGGT(G)GAC 
fT)GTGCT)GCG(C)CC-3'], is derived from the cyanogen bromide fragment of KIE 
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endosialidase with the longest unambiguous amino acid sequence, and is the least 
degenerate of five probes designed using the partial amino acid sequences obtained from 
the cyanogen bromide fragments. A 1.9 kb Bglll restriction digest fragment of genomic 
bacteriophage E DNA is identified as potentially encoding endosialidase sequence by 
Southern blot analysis using ^P-radiolabelled probe 1. Bglll and BamHI restriction 
endonucleases generate cohesive protruding ends with the same sequence and this 
enables the ligation of the l.9kb Bglll fragment into the BamHI site of Bluescript SK+ cloning 
vector (Promega Inc). Plasmid miniprep DNA from a clone transformed with the resultant 
recombinant vector (Clone I) yields DNA sequence which encodes a deduced protein 
sequence containing a stretch of sequence identical to that of the CNBr fragment used to 
design Probe 1 . 

Probe 2 [5'-GATCTTGGTCTAATCCCT-3']. a non-degenerate oligonucleotide 18-mer, is 
synthesised using the sequence at the 5* end of Clone I. This probe identifies one of two 
Sin I digest fragments of genomic bacteriophage E DNA which runs as a singlet equivalent 
to about 3.3kb. It is verified that this fragment codes for DNA sequence upstream of the 5' 
end of Clone I by digesting the Clone I insert DNA with Sinl. The result of this digest shows 
there are at least 3 Sinl sites in the Clone 1 insert DNA, the largest fragment being l.lkb. 
Since restriction analysis of bacteriophage E DNA shows that there are only two Bglll sites 
in the whole genome, the gel purified Sinl fragments are digested with Bglll and the 
fragment containing the probe 2 recognition sequence and the Bglll site yields two 
fragments of 2.lkb and l.lkb. The 2.lkb Sinl x Bglll digest fragment is cloned into Bluescript 
SK+ by ligation of the Bglll end to a BamHI end, followed by end-filling using the Klenow 
fragment of T4 DNA polymerase and iigating the resultant blunt ends together to circularise 
the plasmid. The resultant clone (Clone 2) is found to contain an open reading frame 
encoding the N-terminus of KIE endosialidase by comparison with the N-terminal amino acid 
sequence of the - 76 kDa enzyme subunit. Overlapping sequence is obtained for clones I 
and 2 in both 5' and 3* directions, and the positions of open reading frames are determined 
by codon preference and positional base preferences analysis (Staden et al, (1982) Nuc. 
Acids Res. 10, 141-156 and Staden (1990) Meth. Enzymol. 183, 163-180). 
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Recombinant plasmid DNA is purified from Clone 2, linearised by cleavage of the unique 
EcoRI site and 5' capped RNA is transcribed using SP6 RNA polymerase and mCAP mRNA 
capping kit (Stratagene Inc.). In vitro translation reactions (25 using O.ljxg RNA 
transcript, 20nQi [^S] methionine and a rabbit reticulocyte lysate system are carried out 
according to manufacturer's instructions (Promega Inc.). Confirmation that the SP6 RNA 
polymerase and the in vitro translation system are functional is obtained by running a 
positive control alongside. The control plasmid is a linearised SV64-carboxypeptidase E 
construct with an upstream SP6 promoter region (Fricker et ai. (I989) Mot. Endocrinol. 3, 
666-673). 

A fragment of bacteriophage E DNA of I892 bp containing the complete Clone I insert is 
excised from Clone I using EcoRI and Xbal. This is directionally cloned into the vector 
pGEM-llz (Promega Inc.) cut with the same restriction enzymes thus placing a Sad site 3* of 
the Clone 1 insert. A 707bp Sacl/Avrll fragment is excised from this new construct This 
707bp fragment encodes the predicted C-terminal II4 amino acids of the endosialidase and 
the 3* untranslated region of KIE DNA. It is ligated into the 3253 bp product of a Sacl/Avrll 
digest of Clone 2. The resulting plasmid (Clone 3) contains only the extreme 5' and 3' 
regions of the originally cloned KIE DNA in a Bluescript SK+ vector effectively lacking the 
central 2975bp of the DNA sequence which includes the sequence encoding the predicted 
endosialidase open reading frame. A 2975bp fragment derived from an Avrll digest of total 
KIE DNA is ligated into Clone 3 digested with Avrll. The resulting construct in Bluescript 
SK+ (Clone 4) contains the full length endosialidase gene previously encoded in Clones I 
and 2, and the gene is sequenced using the Sequenase 2.0 sequencing kit (USB Corp). 
It has the sequence shown in SEQ, ID. No: 1 . 
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Example 2: Preparation of Recombinant Plasmid for Expression nf 

Bacteriophage E Endosialidase 

Clone 4, the DNA construct containing the complete endosialidase open reading frame 
prepared as described in Example I, is subjected to PCR using primers 
S'-CCGGGGATCCATGATTCAAAGACTAGGTTCTTCATTA-S' and 
S'-CGTTAGACGACGTGCGGTCTTGTGTATCTTAAGACAC-S* to facilitate amplification of 
the endosialidase open reading frame with incorporation of a BamHI restriction site and an 
EcoRI restriction site at the 5' and 3* termini of the open reading frame respectively. 

The 2483 bp PCR product is cleaned by extraction first with a mixture of equal volumes of 
phenol (equilibrated to pH 8.0 with 2-amino-2-hydroxymethylpropane-1 ,3-diol) and a 24:1 
mixture of chloroform and isoamyl alcohol, then with the chloroform: isoamyl alcohol mixture 
alone, followed by precipitation in ethanol and resuspension in TE buffer (10 mM 2-amino-2- 
hydroxymethylpropane-1,3-diol hydrochloride, I mM EDTA, pH8.0). The cleaned PCR 
product and pGEX-2T expression vector (Pharmacia Biotech) are digested simultaneously 
with BamHI and EcoRI and purified by agarose gel electrophoresis and Qiaex extraction 
(Quiagen Corp). The cut PCR product and expression vector are ligated using T4 DNA 
ligase (New England Biolabs) according to the manufacturer's instructions, to form a 
recombinant vector, which is sequenced, using USB Sequenase 2.0. across the two cloning 
sites to verify that the correct reading frame has been maintained. 

Example 3: Transformation and Expression 

The ligation product of Example 2 is used to transform electrocompetent E. coli MC I06I 
cells using a Bio Rad electroporation apparatus and the transformed cells are selected by 
restriction analysis of a miniprep DNA sample obtained from the cells. 
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The transformed cells are cultured to express a fusion protein by the addition of IPTG to a 
final concentration of 0.5mM, when the ODeoo has reached 0.3, and are then allowed to 
grow for a further 4 hours at 37°C with shaking at 250 rpm. The expressed fusion protein is 
purified from the culture medium by binding to glutathione beads as described by Smith and 
Johnson (I988) Gene 67, 31-40. 

Samples of the bacterial culture and of purified fusion protein fractions are subjected to 
SDS-PAGE, electrophoretically transferred to a nitrocellulose membrane, washed and 
hybridised to antiendosialidase polyclonal antiserum by the method of Sambrook et al, 
(I989), op. cit. Immunoreactive bands are detected by binding a second antibody 
conjugated to alkaline phosphatase and reaction with (a) a 50mg/ml solution of nitroblue 
tetrazolium chloride in a 70:30 mixture of dimethylformamide and water and (b) a 50mg/ml 
solution of 5-bromo-4-chloro-3-indolyl phosphate disodium salt in water. 

The release of N-acetylneuraminic acid (NANA) from polysialic acid by purified fractions of 
the fusion protein is measured using the TBA assay of Horgan (I98I) Clin. Chim. Acta H6 f 
409-4I5. The measurements show that rate of release of NANA is directly proportional to 
fusion protein concentration. No release of NANA is observed when the fusion protein is 
replaced by glutathione S-transferase protein alone. 
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SEQUENCE LISTING 

(I) INFORMATION FOR SEQ. ID. NO: 1 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(ti) MOLECULE TYPE: DNA (genomic) 

(iv) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteriophage E 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 .. 2436 

(D) OTHER INFORMATION: / product = "coding region for bacteriophage E 

endosialidase" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 
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Claims 

1 . A recombinant protein having bacteriophage endosialidase enzymatic activity 
obtainable by expression from a recombinant vector comprising a DNA sequence encoding 
a bacteriophage endosialidase linked to a DNA sequence of an expression vector which 
expresses a polypeptide which adds to the N- terminus of the endosialidase, or an 
analogue of said protein which is a mutant, functional fragment or derivative of said protein 
having endosialidase enzymatic activity. 

2. A protein or analogue according to claim 1 , in which the DNA sequence encoding the 
endosialidase is derived from a DNA construct encoding amino acid residues encoded by 
nucleotides 172 to 1744 of SEQ. ID. No. 1. 

3. A protein or analogue according to claim I, in which the DNA sequence encoding the 
endosialidase is derived form a DNA construct encoding amino acids encoded by 
nucleotides I to 2436 of SEQ. ID. No. 1 . 

4. A protein according to claim I, 2 or 3 which is a fusion protein comprising the 
endosialidase linked directly or through a spacer to a polypeptide derived from the 
expression vector, or an analogue of said protein which is a mutant, functional fragment or 
derivative of said protein. 

5. A protein or analogue according to claim 4 in which the polypeptide is glutathione S- 
transferase. 

6. A protein or analogue according to any of claims 1 -5, in which the DNA sequence 
encoding the endosialidase is derived from a DNA construct comprising a DNA fragment 
encoding the endosialidase cloned into a bacterial cloning vector. 

7. A protein or analogue according to claim 6. in which the DNA fragment is derived from 
genomic bacteriophage E DNA or a synthetic DNA which is substantially homologous 
thereto. 
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8. A protein or analogue according to daim 6 or 7, in which the DNA construct is 
amplified by a polymerase chain reaction using primers incorporating restriction sites. 

9. A recombinant vector comprising a DNA sequence encoding a bacteriophage 
endosiaJidase linked to a DNA sequence of an expression vector which expresses a 
polypeptide which adds to the N- terminus of the endosialidase, said recombinant vector 
being capable of directing expression of said protein in a compatible host cell. 

10. A recombinant vector according to claim 9 in which said encoding DNA sequence is a 
DNA construct according to any of claims 6 to 8. 

11. A recombinant vector according to claim 9 or 1 0. in which the expression vector is a 
prokaryotic expression vector. 

12. A recombinant vector according to any of claims 9 to 11 , in which the expression 
vector is a pGEX vector. 

13. A recombinant vector according to any of claims 9 to !2 f in which the expression 
vector is pGEX - 2T. 

14. A process for the production of a protein according to any of claims I to 8 which 
comprises culturing a host cell transformed with a recombinant vector according to any of 
claims 9 to 13 under conditions allowing expression of said protein, and isolating the protein 
thereby produced. 

15. A process according to claim 14, in which the host cell is a microbial cell. 

16. A process according to claim 14, in which the host cell is an E. coli cell. 

1 7. A host cell transformed with a recombinant vector according to any of claims 9 to 13. 

1 8. A host cell according to claim 1 7, which is a transformed microbial cell. 
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1 9. A host cell according to clam 1 7 which is a transformed E. coli cell. 

20. A pharmaceutical composition comprising a protein or analogue according to any of 
claims I to 8, or a pharmaceutical^ acceptable salt thereof, optionally together with a 
physiologically acceptable carrier. 

21 . A protein or analogue according to any of claims I to 8 for use in the diagnosis or 
treatment of a medical condition. 

22. Use of a protein or analogue according to any of claims I to 8 in the preparation of a 
medicament for the diagnosis or treatment of meningitis or other condition caused by E. coli 
Kl or by other bacteria expressing polysialic acid on the cell surface thereof, of cancer 
characterised by expression of polysialic acid on the surface of a tumour cell or of tumour 
metastasis. 

23. Use of a protein or analogue according to any of claims I to 8 in the analysis of a 
glycoprotein. 

24. A method of treating a condition caused by a bacterium expressing polysialic acid on 
a cell surface thereof, cancer characterised by expression of polysialic add on a tumour cell 
surface, or tumour metastasis, which comprises administering a protein or analogue 
according to claim 1 to a warm-blooded mammal in need of such treatment. 
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